BaseNPs that contain gene names: domain specificity and genericity
نویسنده
چکیده
The names of named entities very often occur as constituents of larger noun phrases which denote different types of entity. Understanding the structure of the embedding phrase can be an enormously beneficial first step to enhancing whatever processing is intended to follow the named entity recognition in the first place. In this paper, we examine the integration of general purpose linguistic processors together with domain specific named entity recognition in order to carry out the task of baseNP detection. We report a best F-score of 87.17% on this task. We also report an inter-annotator agreement score of 98.8 Kappa on the task of baseNP annotation of a new data set.
منابع مشابه
Cloning and Expression of Recombinant Camelid Single-Domain Antibody in Tobacco
Antibodies provide a suitable tool in fundamental research and their high affinity and specificity make them invaluable for diagnostic and therapeutic applications. A promising alternative to conventional antibodies are the heavy chain antibodies (VHH) of Camelidae having short length, high solubility and stability are preferred to other antibody derivatives. In this study, our goal was product...
متن کاملA Quasi-Dependency Model for Structural Analysis of Chinese BaseNPs
The paper puts forward a quasidependency model for structural analysis of Chinese baseNPs and a MDL-based algorithm for quasidependency-strength acquisition. The experiments show that the proposed model is more suitable for Chinese baseNP analysis and the proposed MDLbased algorithm is superior to the traditional MLbased algorithm. The paper also discusses the problem of incorporating the lingu...
متن کاملADAM Gene Expression in The Adult CNS and Genetic Aberrations in Cancer Cells
ADAM metalloprotease-disintegrins share a common modular structure of functional domains for proteolytic, cell adhesion, and signaling interactions. The metalloprotease domain of oughly half of the known ADAMs contain an intact consensus metzincin catalytic site, and they are thus thought to function as active metalloproteases. The types of interactions mediated by ADAMs are expressly conspicu...
متن کاملIncreasing Software Reliability through Use of Genericity
In our opinion, methods for construction of reliable software are of great importance. Reliability engineering must start in the earliest phases of a software project, and it has to consider not only the architectural level, but should be in the mind of humans even earlier when analyzing the problem space. In a previous paper, we argued that reduction of redundancy of software is a central fact...
متن کاملP-85: How a Frame Shift Caused by a Single Base Deletion In SEPT12 Gene Shed Lights As a Polymorphism
Background: Septins are members of highly conserved polymerizing GTP binding proteins well described in the animal kingdom. 14 Septin proteins have been characterized in humans (SEPT1-SEPT14), some of which are tissue-specific. All of 14 genome-mapped human septins contain a highly conserved central GTP-binding domain which is very critical in GTPase signaling properties as well as oligomerizat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007